Picture for Zhengyuan Yang

Zhengyuan Yang

Residual Decoder Adapter: ID-Preserving Tokenizer Adaption for Autoregressive Text Rendering

Add code
Jun 01, 2026
Viaarxiv icon

Planning with the Views via Scene Self-Exploration

Add code
May 28, 2026
Viaarxiv icon

SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

Add code
May 19, 2026
Viaarxiv icon

TextGround4M: A Prompt-Aligned Dataset for Layout-Aware Text Rendering

Add code
Apr 27, 2026
Viaarxiv icon

MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation

Add code
Apr 16, 2026
Viaarxiv icon

FlowInOne:Unifying Multimodal Generation as Image-in, Image-out Flow Matching

Add code
Apr 08, 2026
Viaarxiv icon

RAGEN-2: Reasoning Collapse in Agentic RL

Add code
Apr 07, 2026
Viaarxiv icon

BizGenEval: A Systematic Benchmark for Commercial Visual Content Generation

Add code
Mar 26, 2026
Viaarxiv icon

RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents

Add code
Feb 02, 2026
Viaarxiv icon

ProImage-Bench: Rubric-Based Evaluation for Professional Image Generation

Add code
Dec 13, 2025
Viaarxiv icon